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ABSTRACT 

Statistical observations of the Epoch of Reionization using the 21 cm line of neutral hydrogen have 
the potential to revolutionize our understanding of structure formation and the first luminous ob- 
jects. However, these observations are complicated by a host of strong foreground sources. Several 
foreground removal techniques have been proposed in the literature, and it has been assumed that 
these would be used in combination to reveal the Epoch of Reionization (EOR) signal. By studying 
the characteristic subtraction errors of the proposed foreground removal techniques, we identify an 
additional subtraction stage that can further reduce the EOR foreground contamination, and study 
the interactions between the foreground removal algorithms. This enables us to outline a compre- 
hensive foreground removal strategy that incorporates all previously proposed subtraction techniques. 
Using this foreground removal framework and the characteristic subtraction errors, we discuss the 
complementarity of different foreground removal techniques and the implications for array design and 
the analysis of EOR data. 
Subject headings: 



1. INTRODUCTION 

Highly redshifted 21 cm neutral hydrogen emission 
from the Epoch of Reionization (EOR) is a unique cos- 
mological probe, and observations with planned low fre- 
quency radio telescopes could revolutionize our under- 
standing of galaxy and structure formation and the emer- 
gence of the first luminous objects. 

The p otential of 21 cm obs e rvatio ns was first recog- 
nized by iSu^a^v^^Zeldovicbl (1 1 Q 721) and further devel- 
oped b y iScott fc Reesl (Il990f>: iMadau Meiksin fe Reel 
l|1997lk iTozzi et alJ (|200(t ) : llliev et alJ (|2002fl . There are 
five possible experimental signatures produced by neu- 
tral hydrogen during the EOR which can be targeted 
with low fre quency radio obser vations: the global fre- 
ouen cv step l| ghaver_e±_alJllfl9Sl) , d i rect imaging 1 HI for - 
rest (jCarilli. Gnedin fc Owenll20n2HCarilli et al.ll2l)04bT) . 
Stro mgren sphere mappin g, and statistical observations 
(see ICarilli et alJ l|2004a|) for overview). Of the five 
experiment al signatures, the static a l observations de- 
velope d by iZaldarri aea et a l.l (120041) . [ Morales fc Hewitt! 
(|2004f) . and iBharadwai fc Alii l)2005|) offer the most 
promise and cosmological power, and are being targeted 
by the Mileura Widefield Array (MWA), the LOw Fre- 
quency ARray (LOFAR), and the Chinese Meter Array 
(21CMA, formerly PAST). 

Unlike the Cosmic Microwave Background (CMB) 
emission, which is inherently two dimensional (sky po- 
sition), the EOR data is three dimensional because the 
rcdshift of the observed neutral hydrogen emission maps 
to the line-of-sight distance. This allows us to extend 
the statistical techniques developed for the CMB to three 
dimensions, and use power spectrum statistics to study 
the EOR. These statistical analysis techniques dramati- 
cally increase the sensitivity of first generation EOR ob- 
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servations, and allow much more detailed studies of the 
cosmology lIBowman et all 120061 iFurlanetto et al J 120041 
IMoralesl l2005') . The major remaining question of EOR 
observations is whether the foreground contamination — 
which is ^5 orders of magnitude brighter than the neu- 
tral hydrogen radio emission — can be removed with the 
precision needed to reveal the underlying EOR signal. 

Several foreground subtraction techniques have been 
suggested in the literature. Bright sources can be identi- 
fied and removed as in CMB and galaxy clustering anal- 
yses, but the faint emission of sources below the detec- 
tion tlrresjio2d_wfil^tin_ov the weak EOR sig- 
nal (jDi Matte o et al.ll2002l) . Additional foreground con- 
tamination can be removed by fitting a spectral model 
to each pixel in the sky to remove t he many faint con- 
tinuum sources in each line of sight (Br iggs et aLlfeOOH 
Wang et alJ 1200 4) . A similar technique proposed by 
Zaldarriaga et al. (2004) moves to the visibility space 
(2D FT of image-frequency cube to obtain wavenum- 
bers in sky coordinates and frequency along the third 
axis) and then fits smooth spectral models for each 
visibility ( San tos et alJ l2005j) . This method should be 
better at removing emission on larger angular scales, 
such as continuum emis sion from our own Milky Way. 
iMorales fc Hewitt! l)2004j) introduced a subtraction tech- 
nique which exploits the difference between the spherical 
symmetry of the EOR power spectrum and separable- 
axial symmetry of the foregrounds in the three dimen- 
sional Fourier space, and is particularly well suited for 
removing very faint contaminants. 

Foreground removal has been envisioned as a multi- 
staged process in which increasingly faint contaminants 
are subtracted in a stepwise fashion. By studying the 
errors made by proposed foreground subtraction algo- 
rithms, we identify an additional subtraction stage where 
the average fitting errors of the proposed algorithms are 
subtracted from the three dimensional power spectrum. 
This residual error subtraction step can significantly re- 
duce the residual foreground contamination of the EOR 
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signal, and differs from CMB techniques by relying on 
the statistics of the errors and separating the residual 
contamination from the power spectrum instead of the 
image. 

Because the residual error subtraction relies on the 
statistical characteristics of the subtraction errors, the 
foreground removal steps become tightly linked and we 
must move from focusing on individual subtraction algo- 
rithms to the context of a complete foreground removal 
framework. This paper outlines a comprehensive fore- 
ground removal strategy that incorporates all previously 
proposed subtraction techniques and introduces the new 
residual error subtraction stage. Treating the foreground 
removal process as a complete system also allows us to 
study the interactions (covariance) of the subtraction al- 
gorithms and identify the types of foreground contami- 
nation each algorithm is best suited for. 

Section |2] reviews the properties of the data produced 
by the first generation EOR observatories. We then in- 
troduce the foreground removal framework in Section [3] 
along with several detailed examples of the subtraction 
errors. Sections 0] and then discuss the implications 
of the foreground removal framework and how it can be 
used to improve the design of EOR observatories and 
foreground removal algorithms. 

2. EXPERIMENTAL DATA 

While there are important differences in the data 
processing requirements of the MWA, LOFAR, and 
21 CM A — and their data analysis systems are rapidly 
evolving — all three experiments follow the same basic 
data reduction strategy. 

All three observatories are composed of thousands of 
simple detection elements (dual polarization dipoles for 
MWA and LOFAR, and single-polarization YAGIs for 
21CMA). The signals of the individual detecting ele- 
ments are then combined with analog and digital systems 
into "antennas" of tens to hundreds of elements, which 
are then cross-correlated to produce the visibilities of ra- 
dio astronomy. These visibilities are the basic observable, 
and are the spatial Fourier transform of the brightness 
distribution on the sky at each frequency. The visibili- 
ties from each experiment — up to 4 billion per second in 
the case of the MWA — must then be calibrated and inte- 
grated to form one long exposure. The final data prod- 
uct is a visibility cube representing a few hundred hours 
of observation, which can be either Fourier transformed 
along the angular dimensions to produce an image cube 
for mapping, or along the frequency axis to produce the 
Fourier representation for the power spectrum analysis. 

Going from the raw visibilities produced by the corre- 
lators to visibility cubes representing hundreds of hours 
of observation is a Herculean task, and we do not wish 
to minimize the effort involved in this stage of the pro- 
cessing. The ionospheric distortion must be corrected 
using radio adaptive optics, and the time variable gain, 
phase, and polarization of each antenna must be pre- 
cisely calibrated. Going from the raw visibilities to the 
long integration visibility cube tests and displays the art 
of experimental radio astronomy. However, in the end 
all three experiments will produce the same basic data 
product, and for our purposes we will concentrate on how 
to process this long integration visibility cube to remove 
the astrophysical foregrounds and reveal the cosmologi- 



cal EOR signal. 

3. FOREGROUND SUBTRACTION FRAMEWORK 

Fundamentally, all the proposed foreground subtrac- 
tion techniques exploit symmetry differences between the 
foregrounds and the EOR signal, and are targeted at 
removing different types of foreground contamination. 
Because the EOR signal is created by a redshifted line 
source, the observed frequency can be mapped to the 
line-of-sight distance. This produces a cube of space 
where we can observe the HI intensity as a function of po- 
sition. The EOR emission appears as bumps along both 
the frequency and angular directions, and since space 
is isotropic (rotationally invariant) leads to a spherical 
symmetry in the Fourie r space (the 3D Fourier Trans- 
form of the image cube ijMorales fc Hewittil2004|n . This 
contrasts with most of the foreground sources which ei- 
ther have continuum emission which is very smooth in 
the frequency direction, such as synchrotron radiation, 
or emission line radiation which is not redshifted and 
thus at set frequencies, such as radio recombination lines 
from the Milky Way. The foreground removal techniques 
all use the difference between the clumpy-in-all-directions 
EOR signal and the foregrounds which are smooth in at 
least one of the dimensions. 

The foreground subtraction framework can be divided 
into three stages — bright source removal, spectral fit- 
ting, and residual error subtraction — where each step 
subtracts increasingly faint foreground contamination. 
The first two steps utilize well developed radio analy- 
sis techniques and have been previously proposed. The 
residual error subtraction stage extends the symmetry 
ideas of iMorales fc Hewitt! l)2004|) to identifying and re- 
moving the average fitting error of the first two stages. 
In Sections I3.1H3.3I we s tep through the three subtrac- 
tion stages. Section 13.41 then shows several examples of 
how to calculate the characteristics of the subtraction er- 
rors, and Section f3 . 51 discusses how the subtraction errors 
affect the EOR sensitivity. 

One subtlety that arises is which sources should be 
considered foreground in an interferometric observation. 
Diffuse synchrotron emission from our own galaxy is the 
single brightest source of radio emission at EOR frequen- 
cies. The brightness temperature of the synchrotron 
emission towards the Galactic poles is several hundred 
degrees, and dominates the thermal noise of the telescope 
and receiver system. However, the diffuse Galactic syn- 
chrotron emission is so spatially and spectrally smooth it 
is not customarily included in discussions of foreground 
subtraction — the majority of this nearly DC emission is 
resolved out by interferometric observations. Instead the 
Galactic synchrotron contribution is included in sensitiv- 
ity calculations as th e dominant source of system noise 
( Bow man et al.ll2006j) . Simila rly, polarized emission can 
become a major foreground l)Haverkorn et al"ll2f)03[) by 
leaking into the intensity maps. However, since the con- 
tamination of the intensity map is through errors in the 
polarization calibration, this is customarily considered a 
calibration issue and not foreground subtraction. Follow- 
ing these conventions here, we focus on removing the un- 
polarized contributions from resolved foreground sources. 

3.1. Bright Source Removal 
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Fig. 1. — The left panel shows the spherically symmetric power spectrum of the EOR signal (zero is in the center of the lower face), while 
panel (b) shows the separable-axial power spectrum template typical of the residual foregrounds. The power spectrum shapes are known, 
and the amplitudes can be fit in the residual error subtraction stage to separate the residual foreground subtraction errors from the faint 
EOR signal. 



In the first stage the bright contaminating sources, 
both astrophysical and man made, are removed. Be- 
cause the spatial and frequency response of an array is 
not a delta-function, emission from a bright source will 
spill over into neighboring pixels and frequency chan- 
nels. The goal of the first foreground removal stage is 
to subtract the contributions from all sources which can 
contaminate distant locations in the image cube. 

The worst of the radio frequency interference (RFI) 
will be cut out of the data prior to forming the long 
integration visibility cube. What will remain is a sea of 
faint transmissions. The easiest way to remove narrow- 
band transmissions is to identify the affected channels 
(elevated rms) and excise them. Modern polyphase filters 
have very high dynamic range, so only a few channels 
will need to be removed for all but the very brightest 
transmitters. This leads to slices of missing frequency 
information and complicates the experimental window 
function, but is very effective if the dynamic range of the 
polyphase filters is high enough. 

Removing astrophysical sources is conceptually simi- 
lar, but is more difficult due to the lower spatial dynamic 
range of most radio arrays. Emission from a bright astro- 
physical source will leak into pixels far from the source 
position due to the imperfect point spread function of 
an array. Thus we need to use the traditional radio as- 
tronomy subtraction technique of removing the sources 
directly from the visibilities to subtract the array side- 
lobes along with the central emission. 

The signal strength of the EOR is a few mK, so the 
astrophysical and RFI sources must be subtracted until 
the sidelobes are <mK. This places strong constraints on 
the spatial and frequency dynamic range of an array, as 
well as the RFI environment. Unfortunately, even the 
faint emission of galaxies below the de tection threshold 
will o verwhelm the weak EOR signal l)Di Matteo et alJ 
2002), and we must resort to more powerful subtraction 
techniques to reveal the EOR signal. 

3.2. Spectral Fitting 

At the end of the bright source foreground removal 
stage, all sources bright enough to corrupt distant areas 
of the image cube have been removed, and we are left 
with a cube where all of the contamination is local. Here 
we can use foreground subtraction techniques which tar- 
get the frequency characteristics of the foreground emis- 
sion. 



In every pixel of the image cube, there will be contri- 
butions from many faint radio galaxies. The spectrum 
within one pixel is well approximated by a power-law, 
and can be fit and removed. Since the EOR signal is 
bumpy, fitting smooth power law models nicely removes 
the foreground cont ribution while leav ing most of the 
cosmological signal ijBriggs et all I2005D . There are a 
number of subtle effects which must be carefully mon- 
itored, such as changing pixel size, but this is an effec- 
tive way of removing the contributions of the faint radio 
galaxy foreground. 

A si milar method was proposed by Zaldar riaea et alJ 
( 2004) , where smooth spectral models are fit to individ- 
ual spatial frequency pixels in the visibility space. While 
there is a lot of overlap between this method and image 
method, the visibility foreground subtraction technique 
should be superior for more extended objects such as 
fluctuations in the Milky Way synchrotron emission. 

The last type of spectral fitting is to remove radio re- 
combination lines from our own Galaxy. The intensity of 
these lines is uncertain, but since they occur at known 
frequencies, template spectra can be used to subtract 
them. Unlike the smooth power-law spectra, the struc- 
ture in the recombination line spectrum has much more 
power on small scales (line-of-sight redshift distance). 
More work is needed to accurately determine the strength 
of this foreground and develop template spectra. 

The errors made in the spectral fitting stage can be 
classified into two types: model errors due to foreground 
spectra which cannot be fit exactly by the model param- 
eters, and statistical errors due to slight misestimates of 
the model parameters in the presence of thermal noise. 
These errors are discussed at length in Section EP1 

3.3. Residual Error Subtraction 

While the vast majority of the foreground contamina- 
tion will be removed in the first two analysis stages, resid- 
ual foreground contamination will remain due to errors 
in the subtraction process. In the absence of foregrounds 
the EOR power spectrum could be measured by dividing 
the individual power measurements in the Fourier space 
into spherical annuli, and averaging the values within 
each shell to produce a single power spectrum measure - 
ment at the given length scale ijMorales fc Hewitt1 f2004) . 
This reduces the billions of individual power measure- 
ments down to of order ten statistical measurements, and 
is behind the extraordinary sensitivity of cosmological 
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power spectrum measurements l)Bowman eta0[2006). 

However, the first two stages in the foreground subtrac- 
tion are not perfect. For example, in the bright source 
removal stage the flux of each source will be slightly mis- 
estimated, leading to faint residual positive and nega- 
tive sources at the locations of the subtracted sources. 
These faint residual sources inject spurious power to the 
three dimensional power spectrum. It is impossible to 
determine what these subtraction errors are individually 
(otherwise we would improve them), however, we can 
predict, measure, and remove the average effects of this 
residual foreground contamination from the power spec- 
trum. Since the power spectrum is related to the square 
of the intensity, residual positive and negative sources 
have the same power spectrum signature, and the ampli- 
tude of the residual power spectrum signal is related to 
the standard deviation of the subtraction errors made 
in the first two stages. Different types of foreground 
subtraction errors produce distinct shapes in the three 
dimensional power spectrum, and are easily differenti- 
ated from the a pproximate spherical sym metry of the 
EOR signal ( see iMorales fc Hewittl l|2004|) and Section 
I3.4JI . Figure H shows the three dimensional power spec- 
trum shapes typical of the signal and residual foreground 
components. 

So in the presence of foregrounds our final stage of the 
analysis becomes a multi-parameter fit, with each com- 
ponent of the residual foreground and the EOR signal 
being represented by a corresponding 3D power spec- 
trum template and amplitude. The measurements are 
then decomposed into template amplitudes to separate 
the EOR signal from the residual contamination from 
foreground subtraction errors in the first two stages. In 
effect this final subtraction stage allows us to not only fit 
the local foreground parameters (position, spectra, etc.) 
as in the first two stages, but to also fit the width of the 
subtraction errors. 

The errors produced by the first two foreground sub- 
traction stages depend on the details of both the al- 
gorithm and the array. For example, errors made in 
the spectral fitting stage depend on both the spectral 
model used (qudratic, power-law, etc.) and how the 
pixel shape varies with frequency (array design). This 
precludes defining a set of residual error templates that 
is generally applicable, but calculating the templates for 
a specific analysis is straightforward, as demonstrated in 
the following section. 

3.4. Example Subtraction Error Templates 

To separate the residual foreground and cosmological 
signals in the residual error subtraction stage of the EOR 
analysis, we need to predict the shape of the residual fore- 
ground contamination as seen in the three dimensional 
power spectrum. The first two stages of foreground sub- 
traction remove the majority of the contamination, so 
what we see in the residual error subtraction stage is not 
the original power spectrum shape of the foregrounds, 
but instead the shape of the errors characteristic of the 
first two foreground removal stages. In the following sub- 
sections we provide examples of how to calculate the 
residual error templates, and discuss the characteristic 
power spectrum shapes. 

3.4.1. Statistical Spectral Fitting Errors 




frequency 

Fig. 2. — This cartoon shows the true foreground continuum 
spectrum observed in one pixel as a black line, and the error in 
fitting parameter b due to thermal noise. The inset shows the 
expected Gaussian profile of Ab = (bj< — b) and the width of the 
distribution 05. 



In the spectral fitting foreground subtraction stage, a 
smooth spectral model is fit to each pixel to remove the 
contributions of faint continuum sources. However, due 
to the presence of thermal noise the fit spectrum is not 
exactly the same as the true foreground. These slight 
misestimates of the foreground spectra in each pixel pro- 
duce a characteristic power spectrum component. The 
exact shape of this power spectrum template of course 
depends on the spectral model one chooses. Over the 
relatively modest bandwidths of proposed EOR measure- 
ments the foreground emission is reasonably well mod- 
eled by a quadratic spectrum liBri ggs et al.l |2005l). and 
as an illustrative example we demonstrate how to calcu- 
late the power spectrum template in the case of a simple 
quadratic spectral model. 

For a quadratic foreground subtraction algorithm the 
residual foreground contamination is given by: 



AS(f) = Aa df 2 + Abdf + Ac, 



(1) 



where df is the difference between the observed frequency 
and the center of the band, and Aa, Ab, Ac represent 
the difference between the true parameter value for the 
foreground and the fit value. Figure [21 depicts errors in 
fitting parameter b for one pixel. Moving to the line- 
of-sight wavenumber n with a Finite Fourier Transform 



gives 



AS(9 x ,9 y ,n) 



AaB iAbB 



+ AcS k (v), (2) 



irrj 7T7/ 

where B is the bandwidth of the observation, and we 
have explicitly shown that this for a particular line of 
sight 9 x ,9 y . 3 

To compare with the three dimensional EOR power 
spectrum we need to move from the angular coordinates 

3 Equation |5j assumes the function is sampled at the center of 
each bin, whereas in most measurements the value is integrated 
over the bin. The two definitions converge as Aa — > 0, so we use 
Equation [2] as a very good approximation. 
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9 x ,6 y to the spatial frequencies u,v. Even though the 
foreground is spatially clustered, we expect the fitting 
errors to be distributed like white noise. This allows us 
to calculate the Fourier transform for the zero frequency 
term and generalize to the other values of u and v: 



n [Hz 1 ] 



,x10 10 



e 

AS(u, v,r)) = J2 AS (°x> v> v)dO x dd v 

where O is the field of view and d6 x d6 y = dil is the angu- 
lar resolution of the array (measured in steradians). The 
root mean square of the sum is given by the square root 
of the number of independent lines of sight (y/Q/dQ,) 
times dfl 



(3) 



AS(u,v,r)) rms = AS{9 X , 9 y , rj) rms y/ QdQ. (4) 

If the errors in the fitting parameters are Gaussian dis- 
tributed, the average power spectrum will be 



(P s {r])) = 2edilB 2 
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7T 2 ?7 2 



(5) 



where o~ a and o~b are the standard deviations of Aa and 
Ab respectively, and the term cv has been re-defined 
to include all the contributions proportional to the Kro- 
necker delta function 8 k (rj). For each visibility u,v, the 
power spectrum will be exponentially distributed around 
the average, but will become Gaussian distributed when 
many lines-of-sight are averaged together. 

Equation [S] gives the average power spectrum due to 
statistical fitting errors with a simple quadratic spectral 
model. A plot of this fitting error power spectrum tem- 
plate along the line-of-sight is shown in Figure [3J For 
pixel-based fitting algorithms the thermal noise is nearly 
constant for all pixels, and thus the magnitude of the 
residual power spectrum is nearly equal for all visibil- 
ities. This provides a particularly simple three dimen- 
sional power spectrum template that falls as a high power 
of rj and is constant for all visibilities, and can be visu- 
alized as a plane of power near -q = that quickly falls 
away (see the right hand panel of Figure ^ f° r the ba- 
sic shape). For visibility-base d subtraction algorithm s 
tuned for larger scale structure IjZaldarri aga et aLl l2004). 
the thermal noise at large angular scales is much lower 
due to the central condensation of realistic arrays, so the 
power will be concentrated near the ry = plane but 
reduced in amplitude at small visibilities. 

The a values in Equation represent the standard de- 
viation of the quadratic spectral fitting algorithm and 
determine the amplitude of the residual subtraction er- 
rors to the signal. The a represent an ensemble statistic, 
as compared to the individual fits to each pixel made 
by the spectral fitting algorithm. In the full analysis we 
fit local parameters for each pixel/ visibility during the 
spectral fitting stage, and then we fit the errors (<r) in 
the residual foreground subtraction stage. This pattern 
of fitting local parameters in the first two stages, and 
the statistical distribution of their errors in the third is 
repeated for each type of error. 

The example power spectrum template presented here 
only applies to a simple quadratic spectral model, but 
similar results can be obtained if power-law or other fore- 
ground models are used instead. They key is to deter- 
mine the shape of the power spectrum produced by the 
local spectral fitting errors, which can then be fit globally 
in the residual error subtraction stage of the analysis. 
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Fig. 3. — This figure shows the shape of the power spectrum 
contributions from the EOR signal and the statistical spectral fit 
residuals in observer's units (top panel) and theorist's units (bot- 
tom panel) along the line-of-sight direction (77 or Ay). The observed 
EOR signal is shown as a solid black line, and the linear (oc rj~ 2 ) 
and quadratic (oc rf~ 4 ) residuals are shown as thick and thin grey 
lines respectively. The vertical grey line shows the edge of the rj = 
bin for a 16 MHz bandwidth. All power spectrum measurements 
to the left of the vertical line are purely angular while the measure- 
ments to the right of the line use both the angular and frequency 
information. The dashed lines show the binning effects for the lin- 
ear and quadratic components of the residual foreground, with the 
dash-dot line showing the 5 fc -function contribution from the off- 
set term. The amplitudes of the residual foreground components 
depend only on the standard deviations of the fitting parameter 
errors (a a , ff, , a c i ) and are fit using parameter estimation in the 
residual subtraction stage of the analysis. 



3.4.2. Spectral Model Errors 

In addition to the statistical fitting errors discussed in 
the previous subsection, there is a class of model errors 
which can be made in the spectral fitting stage. Sim- 
ple foreground spectral models may be unable to fit the 
underlying foreground spectrum. Even if all the source 
spectra were perfect power laws, there are many sources 
per pixel, which can lead to a complex cumulative spec- 
trum which cannot be exactly fit by a simple spectral 
model. Figure 0] shows the origin of the model errors. 
Because the summation of sources and spectra that forms 
the cumulative spectrum in each pixel is a statistical pro- 
cess, we expect the model errors to be statistical. This 
allows one to follow the same process we used in the 
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Fig. 4. — This cartoon shows the difference between the spec- 
trum in a pixel (solid line) and the best fit model in the absence 
of measurement noise (dashed line). Because the model cannot ex- 
actly follow the true spectrum, an additional component is added 
to the power spectrum which can be estimated and removed in the 
residual error subtraction stage of the analysis. 



previous section and form an expected power spectrum 
shape due to model errors which can be fit in the resid- 
ual error subtraction stage of the analysis. This model 
error template will depend sensitively on both the cho- 
sen spectral model and the imaging characteristics of the 
a rray. 

Brie gs et alJ l)2005|) showed that using realistic bright- 
ness counts dN/dS and spectral slopes, the cumulative 
spectrum is typically dominated by the brightest source 
in the pixel, and even for a simple quadratic spectral fit 
the model error is expected to be less than the cosmo- 
logical EOR signal. Thus we expect the power added 
by model errors to be quite small. But by including the 
model error power spectrum in the parameter fit, we can 
eliminate any bias that could be introduced into the EOR 
signal. 

3.4.3. Bright Source Subtraction Errors 

Astrophysical sources which are bright enough to con- 
taminate distant areas of the image cube are removed in 
the bright source subtraction stage of the analysis (Sec- 
tion ^. II) . The errors in this foreground removal stage are 
primarily due to misestimates of the source intensities. 
The bright source subtraction errors can be envisioned as 
residual positive and negative sources at the locations of 
the subtracted sources, and will produce a distinct fore- 
ground signature in the three dimensional power spec- 
trum. 

The accuracy of determining the fluxes of foreground 
sources depends on their brightness — the strongest 
sources may be subtracted to a few parts per billion while 
faint sources are only subtracted to a few tenths of a per- 
cent accuracy. Thus the residual sources mirror the po- 
sitions of subtracted foreground sources and have a zero 
mean Gaussian distribution of flux, but with a greatly 
reduced dynamic range (the standard deviation of the 
subtraction error a is not proportional to the brightness 



S) . This produces a map of the expected residual sources 
which can be convolved with the point spread function of 
the array and Fourier transformed and squared to create 
the expected three dimensional power spectrum template 
for the errors introduced by the bright source subtraction 
stage of the analysis. 

The source subtraction error template will depend on 
the particular field observed and spectral model (see Sec- 
tion but will be strongly peaked at low rj (primary 
error leads to S k (r])) and concentrated at small visibili- 
ties due to angular clus tering of the foreground sources 
lIDi Matteo et al J 12001 . The compression of the dy- 
namic range must be either modeled, or fit with an addi- 
tional parameter in the residual error subtraction stage 
of the analysis. An example power spectrum template 
is shown in Figure El using sources from the Westerbork 
Northern Sky Survey (WENSS). As can be seen from 
the example, the errors made by the bright source sub- 
traction stage add power, particularly at small u, v and 
?7 = 0. A similar procedure can be used for RFI which 
spills into neighboring frequency channels, and depends 
on the spectral dynamic range of the array and the RFI 
removal procedure. 

3.5. Uncertainty Calculations 

In the final residual foreground subtraction stage, pa- 
rameter estimation is used to separate the EOR signal 
from the residual foreground contaminants, using the 
characteristic power spectrum templates calculated in 
Section EP1 We are left with a measurement of the EOR 
signal strength (typically in ranges of k) and the resid- 
ual foreground amplitudes. The uncertainty of the EOR 
signal determination depends on two factors: the uncer- 
tainty in the residual foreground amplitude determina- 
tions, and their covariance with the EOR signal. 

Calculating the uncertainty of the foreground ampli- 
tudes (A x oc a x ) is complicated by their additional sta- 
tistical correlations as compared to the EOR signal. The 
observed Fourier space has a fundamental correlation im- 
printed by the field-of-view and bandwidth of the obser- 
vation. This can be used to define a natural cell-size 
for the data where the observed EOR signal in each cell 
is lar gely independent ijMorales fc Hewiti 2004: M orales I 
12005ft . The observed data can then be represented as a 
vector of length N u x N v x N n , where N is the number 
of cells in the u, v and rj dimensions respectively. For the 
EOR signal vector s, all of the cells are nearly indepen- 
dent and (ss) w si where I is the identity matrix. 

This is not true for most of the residual foreground 
templates. The residual foreground contributions are of- 
ten highly correlated between Fourier cells ((ff) ^ fl), 
and so average differently than the EOR signal. For ex- 
ample, the linear and quadratic spectral fitting errors (o~ a 
and <7b terms) each imprint a specific residual in all the 
7/ channels for a given u, v pixel — the amplitude of the 
residual will vary from one u, v pixel to the next, but 
are perfectly correlated (deterministic) for the r\ values 
within one pixel. 

When averaging over many Fourier cells and lines-of- 
sight, the uncertainty in the amplitude of any component 
becomes approximately Gaussian distributed and equal 
to 

AA X ^A T X -A X ^ ( 6 ) 
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Fig. 5. — This figure is an example of the expected u, v power spectrum template due to bright source subtraction errors. In this example 
14,510 sources from the Westerbork Northern Sky Survey (WENSS) at 325 MHz IRengelink et alj[T997l) were chosen in a field centered at 
90 deg RA, 60 deg DEC, with a 31 degree FOV (diameter). To model the compression of the dynamic range, the variance of the source 
subtraction error was chosen to be proportional to the log of the flux, and the color map is linear with an arbitrary scale. Note the increased 
amplitude towards the center due to the angular galaxy correlation. The exact power spectrum template will depend on the chosen field 
and the source identification algorithm, but will have this basic shape in the u, v plane and will be highly concentrated towards r] = 0. 



where A x is the amplitude of the x component of the 
signal or residual foreground contribution, T is the true 
value, and N m is the number of independent measure- 
ments (realizations) of this contribution. For the exam- 
ple of the a a and en, terms of the spectral fitting residual 
errors, there are only N m = N u x N v independent mea- 
surements as compared to N u X N v X N v for the EOR 
signal and the thermal noise. This correlation along the 
rj axis comes from using the frequency channels to make 
the original spectral fit, and means that these spectral 
errors will only average down by adding more lines of 
sight, not increasing the number of cells along the rj axis. 
The correlations of the various foregrounds are listed in 
Table H] 

In addition to the uncertainties in the amplitude de- 
terminations, we must also calculate the covariance of 
the parameters. Since the power spectrum templates of 
the residual fitting errors and signal form the basis func- 
tions for the parameter estimation in the residual error 
subtraction stage, they define the covariance of the am- 
plitude terms. Thus, for a given observatory and choice 
of foreground subtraction algorithms, we can calculate 



the residual foreground power spectrum templates and 
resulting uncertainty in the EOR measurement. 

4. IMPLICATIONS FOR ARRAY AND ALGORITHM 
DESIGN 

Since the power spectrum templates of the subtraction 
errors quantify the interactions between the three anal- 
ysis stages, they enable us to study the effects of array 
design on our ability to isolate the statistical EOR signal. 
The difference in the shapes of the power spectrum tem- 
plates determines how easy it is for the parameter estima- 
tion stage of the analysis to separate different contribu- 
tions. If the power spectrum shapes of two contributions 
are similar, it will be difficult to accurately determine 
the amplitudes of the contributions. Mathematically, the 
shapes of the power spectrum templates determine the 
covariance of the parameter estimation matrix, with the 
covariance decreasing as the shapes become more orthog- 
onal. 

The power spectrum templates depend on the details 
of both the array design and analysis technique. For ex- 
ample, the model fitting error template depends on both 
the angular resolution of the array (detailed pixel shape) 



TABLE 1 





Functional Dependence 


Statistical Correlations 




EOR Signal 


k 




N u x N v x N v 


Thermal Noise 


V« 2 + v' 2 




N u X N v x N v 


Spectral Fitting (cr a ,ai,) 


V 


V 


N u x N v 


Spectral Fitting (rj c i) 


s k (v) 




N u x N v 


Spectral Model 


V 


V 


N u x N v 


Source Subtraction 


u, v, r] 


u, v, r] 


-^sources X -/V spcc t r al parameters 



Note. — The table shows the functional dependence and statistical correlations of the signal and foreground components in the Fourier 
space (k or u,v,rj), and the number of independent measurements of the component's amplitude N m . The observed Fourier space has a 
fundamental correlation imprinted by the field-of-view and bandwidth of the observation, and th is can be used to define a natural cell-siz e 
for the data set where the observed EOR signal and thermal noise in each cell is independent ( Morales & Hewitt 2004; Morales 2005). 
However, the residual foreground components have additional statistical correlations between cells, as indicated in the table and described 
in the text. 



and whether a quadratic or logarithmic power law is used 
in the fit. The performance advantages and trade-offs of 
different arrays and analysis techniques is captured in the 
shapes and covariances of the power spectrum templates. 

To date, the experimental community has been uncer- 
tain as how to best design arrays and analysis systems to 
detect the statistical EOR signal. Much of this confusion 
is because no quantitative measure has been available for 
comparing design choices. We feel that the power spec- 
trum templates can provide the necessary figure of merit. 
The shape of the EOR power spectrum signal is known 
(given a theoretical model) , and so we are concerned with 
the amplitude and covariance of the foreground subtrac- 
tion errors of a given array and analysis system with the 
known EOR power spectrum. The power spectrum tem- 
plates define the performance of the array and analysis, 
and allows design trade-offs to be accurately compared. 
In many cases, making a plot analogous to Figure and 
comparing the amplitude and shape of the residual tem- 
plates will be sufficient to guide the array design. 

5. TOWARDS PRECISION FOREGROUND CALCULATIONS 

The residual foreground contamination levels shown in 
Figure are unrealistically small for the first genera- 
tion of EOR observatories. However, the EOR signal 
can still be detected even if the amplitude of the resid- 
ual foregrounds greatly exceeds the EOR signal in the 
Figure. The key question is not the amplitudes of the 
residual foregrounds, but the uncertainties they create 
in measuring the EOR signal, as discussed in Section 
13.51 The foregrounds shown in Figure [3] are what would 
be needed to detect the EOR power spectrum in a single 
pixel, whereas all of the first generation EOR observa- 
tories rely on combining information from many lines of 
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sight. 

Unfortunately, the uncertainty due to foreground con- 
tamination depends strongly on characteristics of the ar- 
ray and observing strategy, for example, a wide field ob- 
servation will average over more lines of sight and thus 
be able to tolerate higher standard deviation in the sub- 
traction than a narrow field observation. This precludes 
defining a set of amplitudes which must be obtained to 
observe the EOR signal for a generic observatory. 

The dependence of the subtraction precision on the de- 
tails of the array makes the task of foreground modelers 
much more difficult. The precision of the foreground re- 
moval is now array dependent, and most researchers are 
not familiar with the subtle array details needed to ac- 
curately calculate the sensitivity of a given observation. 

However, the power spectrum templates and a val- 
ues do offer a way of translating from the characteris- 
tics of a foreground removal algorithm to the sensitiv- 
ity of an array. Modelers can determine the shape of 
the residual foreground contamination (as in Equation 
EJ and the scaling of the a values for their foreground 
removal algorithm. Experimentalists can then use the 
predicted power spectrum shapes and er scalings to de- 
termine the effects of the foreground subtraction for a 
specific observation. This allows researchers studying 
the foreground removal to avoid doing detailed calcu- 
lations for each array and observing strategy, while still 
providing robust results which can guide the experimen- 
tal design of the next generation EOR observations. In 
this way, we hope the foreground removal framework pre- 
sented in this paper will facilitate a conversation between 
foreground modelers and experimentalists and enable ac- 
curate array-specific predictions of the foreground sub- 
traction effects on the up coming EOR observations. 
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